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Abstract: An explicit upper bound on the tail probabilities for the nor- 
malized Rademacher sums is given. This bound, which is best possible in a 
certain sense, is asymptotically equivalent to the corresponding tail prob- 
ability of the standard normal distribution, thus affirming a longstanding 
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1. Introduction, summary, and discussion 

Let £!,...,£„ be independent Rademacher random variables (r.v.'s), so that 
P{ei = 1) = P{ei = —1) = 2 for all i. Let ai, . . . ,a„ be any real numbers such 
that 

al + --- + al^l. (1.1) 

Let 

Sn '■= Ol^l + ■ ■ ■ + CLnSn 
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be the corresponding normalized Rademacher sum. Let Z denote a standard 
normal r.v., with the density function (p, so that ip{x) ^ -i= e^^ " for all real 

X. 

Upper bounds on the tail probabilities P(S'„ ^ x) have been of interest in 
combinatorics/optimization/operations research; see e.g. [3, 4, 17, 18, 28, 34] 
and bibliography therein. Other authors, including Bennett [5], Hoeffding [32], 
and Efron [23], were mainly interested in applications in statistics. The present 
paper too was motivated in part by statistical applications in [75]. 

A particular case of a well-known result by Hoeffding [32] is the inequality 

P{Sn ^ a;) s^ e-^'/2 (^ 2) 

for all X ^ 0. Obviously related to this is Khinchin's inequality — see e.g. sur- 
vey [55]; for other developments, including more recent ones, see e.g. [39, 44, 
54, 86]. Papers [60, 68] contain multidimensional analogues of an exact ver- 
sion of Khinchin's inequality, whereas [67] presents their extensions to multi- 
affine forms in ei , . . . , £„ (also known as Rademacher chaoses) with values 
in a vector space. Latala [43] gave bounds on moments and tails of Gaus- 
sian chaoses; Berry-Esseen-type bounds for general chaoses were recently ob- 
tained by Mossel, O'Donnell, and Oleszkiewicz [51]. For other kinds of improve- 
ments/generalizations of the inequality (1.2) see the recent paper [1] and bibli- 
ography there. 

While easy to state and prove, bound (1.2) is, as noted by Efron [23], "not 
sharp enough to be useful in practice". Exponential inequalities such as (1.2) 
are obtained by finding a suitable upper bound (say £{t)) on the exponential 
moments Ee* " and then minimizing the Markov bound e~*^£(t) on P(S'„ ^ x) 
in i ^ 0. The best exponential bound of this kind on the standard normal tail 
probability P(Z ^ x) is inft^o e~*^ E e*^ = e"^ /^, for any x ^ 0. Thus, a factor 
of the order of magnitude of - is "missing" in this bound, compared with the 
asymptotics P(Z ^ x) ^ - (p{x) as a; — )- oo; cf. the result by Talagrand [81]. Now 
it should be clear that any exponential upper bound on the tail probabilities 
for sums of independent random variables must be missing the - factor. The 
problem here is that the class of exponential moment functions is too small. 

Eaton [19] obtained the moment comparison E/(S'„) ^ Ef{Z) for a much 
richer class of moment functions /, which enabled him [20] to obtain an upper 
bound on P(S'„ ^ x), which is asymptotic to C3 P{Z ^ x) as x — >■ cxd, where 

C3 := 2|i ^ 4.4634 .... 

Eaton further conjectured that P(5„ ^ x) ^ C3- ip{x) for x > -\/2. The stronger 
form of this conjecture, 

P{Sn ^ x) !^ cP{Z ;^ x) (1.3) 

for all x e M with c = C3 was proved by Pinelis [60], along with a multidi- 
mensional extension, which generalized results of Eaton and Efron [21]. Various 
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generalizations and improvements of inequality (1.3) as well as related results 
were given by Pinelis [58, 61, 62, 65, 69, 71, 72, 74] and Bentkus [7-9]. 

Clearly, as pointed out e.g. in [10], the constant c in (1.3) cannot be less than 

P(75(ei +£2)^^/2) 
c, := ^' ^ p^ -=3.1786..., (1.4) 

which may be compared with C3. Bobkov, Gotze and Houdre (BGH) [12] gave a 
simple proof of (1.3) with a constant factor c « 12.01. Their method was based 
on the Chapman-Kolmogorov identity for the Markov chain (Sn)- Such an iden- 
tity was used, e.g., in [63] concerning a conjecture by Graversen and Peskir [25] 
on maxfc^n \Sk\- Pinelis [73] showed that a modification of the BGH method can 
be used to obtain inequality (1.3) with a constant factor c « 1.01 c*. Bentkus and 
Dzindzalieta [11] recently closed the gap by proving that c* is indeed the best 
possible constant factor c in (1.3); they used the Chapman-Kolmogorov iden- 
tity together with the Berry-Esseen bound and a new extension of the Markov 
inequality. Bentkus and Dzindzalieta [11] also obtained the inequality 



P{Sn^x)^\ + l{l^^/2-2/x^) ioYxe{l,V2], (1.5) 

whereas Holzman and Kleitman [34] proved that P(S'„ > 1) ^ j^- 

We should also like to mention another kind of result, due to Montgomery- 
Smith [50], who obtained an upper bound on lnP(S'„ ^ x) and a matching 
lower bound on In P{Sn ^ Cx) for some absolute constant C > 0; these bounds 
depend on x > and on the sequence (ai, . . . , a„) and differ from each other 
by no more than an absolute constant factor; the constants were improved by 
Hitczenko and Kwapien [29] . The result of [50] was inspired by upper and lower 
bounds on the L^-norm of sums of general independent zero- mean r. v. 's obtained 
by Latala [42] and was extended to such general sums in [31]. The proof in [50] 
was in part based on an extension of the improvement of Hoffmann- J0rgensen's 
inequality [33] found by Klass and Nowicki [36]. More recent developments in 
this direction are given in [37, 38]. 

In the mentioned paper [23], Efron conjectured that there exists an upper 
bound on the tail probability P(S'„ ^ x) which behaves as the corresponding 
standard normal tail P{Z ^ x), and he presented certain facts in favor of this 
conjecture. Efron's conjecture suggests that even the best possible constant 
factor c ^ c^, ~ 3.17... in (1.3) is excessive for large x; rather, for such x 
the ratio of a good bound on P(S'„ ^ x) to P(Z ^ x) should be close to 1. 
Theorem 1.1 below provides such a bound, of simple and explicit form. 

Another well-known conjecture, apparently due to Edelman [22, 78], is that 

P(^„ ^x)^ sup„^i P (_^ (e^ + . . . + e„) ^ a;) (1.6) 

for all X ^ 0; that is, the conjecture is that the supremum of P(S'„ > x) over all 
finite sequences (ai, . . . , a„) satisfying condition (1.1) is the same as that over 
all such (ai, . . . , a„) with equal a^'s. Certain parts of the proof of Theorem 1.1 
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may be seen as providing additional credence to this conjecture. On the other 
hand, if (1.6) were known to be true, it would to a certain extent simplify the 
proof of Theorem 1.1. Also, it is noted in [11] that (1.6), used together with the 
Berry-Esseen bound, would imply another known conjecture [4, 28, 34] - that 
P{Sn > 1) ^ J- Yet another interesting conjecture [14, 30, 53, 85] states that 

The main result of the present paper is 
Theorem 1.1. For all real x > 






P{Sn ^x)^ Q{x) := P{Z >x) + ^^ <P{Z>x){l + ^), (1.7) 



where 

C:=5V2^P(|Z| < 1) = 14.10.... (1.8) 

Remark 1.2. The constant factor C is the best possible in the sense that the 
first inequality in (1.7) turns into the equality when a: = n = 1. It would be 
desirable to find the optimal constant C if the constant 9 in the denominator in 
(1.7) is replaced by a smaller positive value, for then the bound Q{x) would be 
decreasing somewhat faster; however, such a quest appears to entail significant 
technical complications. 

Using e.g. part (II) of Proposition 3.1, it is easy to see that the ratio of the 
bound Q{x) in (1.7) to P{Z > x) increases from sa 2.25 to « 3.61 and then 
decreases to 1 as x increases from to « 2.46 to oo, respectively. Figure 1 
presents a graphical comparison of this ratio, Q{x)/ P{Z > x), with 

(i) the best possible constant factor c=:c*w3.18in(1.3); 
(ii) the level 1, which is asymptotic (as x — )• oo) to the ratio of either one 
of the two bounds in (1.7) to P{Z > x), and hence, by the central limit 
theorem, is also asymptotic to the ratio of the supremum of P{S„ ^ x) 
(over all normalized Rademacher sums Sn) to P{Z > x); 
(iii) the ratio of HocfFding's bound e^^ /^ to P{Z > x). 

In Figure 1, the graph of the latter ratio looks like a steep straight line (and 
asymptotically, for large x, is a straight line), most of which is outside the 
vertical range of the picture, thus showing how much the bounds c* P{Z ^ x) 
and Q{x) improve the Hoeffding bound e~^ Z^. 

In view of the main result of Bentkus [6], one immediately obtains the fol- 
lowing corollary of Theorem 1.1. 

Corollary 1.3. Let X,Xi, . . . ,X„ be i.i.d. r.v. 's such that P{\X\ ^ 1) = 1 and 
EX = 0. Then 

P (^l±-^=±^ ^ x) < 2Q„(x) 

for all real x ^ 0, where Qn is the linear interpolation of the restriction of the 
function Q to the set ^(| - [f J + Z). 
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Fig 1. Ratio Q{x)/ P{Z > x) (thick solid) compared with the ratio e ^ ' /P{Z > x) (solid, 
steeply upwards), as well as with the levels 1 (dashed) and c* fs 3.18 (dotted) 



Here we shall present just one more application of Theorem 1.1, to the self- 
normalized sums 

where, following Efron [23] , we assume that the Xi 's satisfy the so-called orthant 
symmetry condition: the joint distribution of siXi, . . . , s„X„ is the same for any 
choice of signs si,...,s„ € {1,-1}, so that, in particular, each Xi is symmet- 
rically distributed. It suffices that the X^'s be independent and symmetrically 
(but not necessarily identically) distributed. In particular, Vn ~ Sn if Xi — aiSi 
for all i. It was noted by Efron that (i) Student's statistic r„ is a monotonic func- 



tion of the so-called self-normalized sum: r„ = a/-^^ V^/-\/1 — Vj^/n and (ii) 
the orthant symmetry implies in general that the distribution of Vn is a mixture 
of the distributions of normalized Rademacher sums Sn- Thus, one obtains 

Corollary 1.4. Theorem 1.1 holds with Vn in place of Sn- 

Note that many of the most significant advances concerning self-normalized 
sums are rather recent; e.g., a necessary and sufficient condition for their asymp- 
totic normality was obtained only in 1997 by Gine, Gotze, and Mason [24]. 

It appears natural to compare the probability inequalities given in Theo- 
rem 1.1 with limit theorems for large deviation probabilities. Most of such the- 
orems, referred to as large deviation principles (LDP's), deal with logarithmic 
asymptotics, that is, asymptotics of the logarithm of small probabilities; see 
e.g. [16]. As far as the logarithmic asymptotics is concerned, the mentioned 
bounds c* P{Z ^ a;) and Q{x) and the Hoeffding bound e~^ 1"^ are all the same: 
In [c* P(Z ^ x)] ^ ln(3(a-) ^ Ine"^"/^ ~ —x^ /2 as a; — > oo; yet, as we have 
seen, at least the first two of these bounds are vastly different from the Ho- 
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eflding bound, especially from the perspective of statistical practice. Results 
on the so-called exact asymptotics for large deviations (that is, asymptotics for 
the small probabilities themselves, rather than for their logarithms) are much 
fewer; see e.g. [16, Theorem 3.7.4] and [56, Ch. VIII]. Note that the inequalities 
in (1.7) hold for all a; > 0, and, a priori, the summands aiSi do not have to 
be identically or nearly identically distributed; cf. conjecture (1.6). In contrast, 
almost all limit theorems for large deviations in the literature - whether with 
exact or logarithmic asymptotics - hold only for x ~ 0{^Jn), with n being the 
number of identically or quasi-identically distributed (usually independent or 
nearly independent) random summands; the few exceptions here include results 
of the papers [52, 64, 76, 77, 87] and references therein, where the restriction 
a; = 0{\/n) is not imposed and x is allowed to be arbitrarily large. In general, 
observe that a limit theorem is a statement on the existence of an inequality, not 
yet fully specified, as e.g. in "there exists some no such that |a;„ — x\ < e for all 
n ^ no" ; as such, a limit theorem cannot provide a specific bound. Of course, 
being less specific, limit theorems are applicable to objects of much greater 
variety and complexity, and limit theorems usually provide valuable initial in- 
sight. Yet, it seems natural to suppose that the tendency, say in the studies of 
large deviation probabilities, will be to proceed from logarithmic asymptotics 
to asymptotics of the probabilities themselves and then on to exact inequalities. 
We appear to be largely at the beginning of this process, still struggling even 
with such comparatively simple objects as the Rademacher sums - the simplicity 
of which is only comparative, as the discussion around Figure 1 in [73] suggests. 
However, there have already been a number of big strides made in this direction. 
For instance, Boucheron, Bousquet, Lugosi, and Massart [13] obtained explicit 
bounds on moments of general functions of independent r.v.'s; their approach 
was based on a generalization of Ledoux's entropy method [46, 47], using at that 
a generalized tensorization inequality due to Latala and Oleszkiewicz [45] . More 
recently, Tropp [83] provided noncommutative generalizations of the Bennett, 
Bernstein, Chernoff, and Hoeffding bounds - even with explicit and optimal 
constants; as pointed out in [83], "[a]symptotic theory is less relevant in prac- 
tice". Yet, as stated above, in the case of Rademacher sums and other related 
cases significantly more precise bounds can be obtained. 

2. Proof of Theorem 1.1: outline 

Let us begin the proof with several introductory remarks. 

In this section, a number of lemmas will be stated, from which Theorem 1.1 
will easily follow. Most of these lemmas will be proved in Section 3 - with the ex- 
ception of Lemmas 2.3 and 2.7, whose proofs are more complicated and will each 
be presented in a separate section. Each of these two more complicated lemmas 
is based on a number of sublemmas - which are stated in the corresponding 
section and used there to prove the lemma; each of these two sections is then 
completed by proving the sublemmas. This tree-like structure appears suitable 
for presentation: first the general scheme of the proof and then gradually down 
to the finer details. 
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There are many symbols used in the proof. Therefore, let us assume a lo- 
calization principle for notations: any notations introduced in a section or in a 
proof of a lemma/sublemma supersede those introduced in preceding sections 
or proofs. For example, the meaning of the X^'s introduced later in this section 
differs from that in Section 1. 

Without loss of generality (w.l.o.g.), assume that 

^ fli ^ . . . ^ a„ =: a, (2-1) 

so that a = max^ at . Introduce the numbers 

whence for all x ^ 

^ ui ^ . . . Sj Wn = xa. 

The proof of Theorem 1.1 is to large extent based on a careful analysis of the 
Esscher tilt transform of the r.v. 5„. In introducing and using this transform, 
Esscher and then Cramer were motivated by applications in actuarial science. 
Closely related to the Esscher transform is the saddle-point approximation; for 
a recent development in this area, see [57]. The Esscher tilt has been used 
extensively in limit theorems for large deviation probabilities, but much less 
commonly concerning explicit probability inequalities - two rather different in 
character cases of the latter kind are represented by Raic [79] and Pinelis and 
Molzon [75]. One may also note that, in deriving LDP's, the tilt is usually 
employed to get a lower bound on the probability; in contrast, in this paper the 
tilt is used to obtain the upper bound. 

The main idea of the proof is to reduce the problem from that on the vector 
(oi, . . . , a„) of an unbounded dimension n to a set of low-dimensional extremal 
problems, involving sums of the form ^^ g{ui). The first step here is to represent 
x^ J gdv, where g{u) := g{u)/u^ (for u ^ 0), 



iE"''^"- (2-2) 



and St denotes the Dirac probability measure at point t, so that i/ is a probability 
measure on the interval [0,a;a]. This step turns the initial finite-dimensional 
problem into an infinite-dimensional one, involving the measure v. However, 
then the well-known Caratheodory principle allows one to reduce the dimension 
to (at most) k—1, where k is the total number of the integrals (with the respect to 
the measure v) involved in the extremal problem in hand. Moreover, it turns out 
that the systems of integrands one has to deal with in the proof of Theorem 1.1 
enjoy the so-called Tchebycheff and, even, Markov properties; therefore, one 
can reduce the dimension even further, to about fc/2, which allows for effective 
analyses. It should also be noted that the verification of the Markov property of 
a finite sequence of functions largely reduces to checking the positivity of several 
functions of only one variable. Major expositions of the theory of Tchebycheff- 
Markov systems and its applications are given in the monographs by Karlin and 
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Studdcn [35] and Krcm and Nudcl'man [40]; closely related to this theory are 
certain results in real algebraic geometry, whereby polynomials are "certified" 
to be positive on a semialgebraic domain by means of an explicit representation, 
say in terms of sums of squares of polynomials; see e.g. [41, 49]. A brief review 
of the Tchebycheff and Markov systems of functions, which contains all the 
definitions and facts necessary for the applications in the present paper, is given 
in [59]. 

Even after the just described reductions in dimension, the proof of Theo- 
rem 1.1 entails extensive calculations, both symbolic and numeric, which we did 
using Mathematica; other advanced calculators should be able to do the job. A 
well-known result by Tarski [15, 48, 82] — which can be viewed as a far-reaching 
development of Sturm's theorem on the real roots of a polynomial — implies 
that systems of algebraic equations/inequalities can be solved in a completely 
algorithmic manner. Similar results hold for algebraic-hyperbolic polynomials 
(that is, polynomials in x,e^,e~^) — as well as for certain other expressions 
involving inverse-trigonometric and inverse-hyperbolic functions (including the 
logarithmic function), whose derivatives are algebraic. However, it was only a 
few years ago that Tarski's algorithm and its further developments were im- 
plemented into widely used computer software. In Mathematica, this is done 
via Reduce and other related commands, such as Maximize and Minimize. In 
particular, command 

Reduce [condl && cond2 && ..., {varl , var2, . . . ,} , Reals] 

returns a simplified form of the given system (of equations and/or inequalities) 
condl, cond2, . . . over real variables varl, var2, .... However, the execution 
of such a command may take a very long time (or require too much computer 
memory) if the given system is more than a little complicated; in such cases, 
Mathematica can use some human help. As for the commands Maximize and 
Minimize, whenever possible they return the exact global maximum/minimum 
subject to the given restrictions; otherwise, these commands return a statement 
implying that Mathematica cannot do the requested exact optimization. Alter- 
natively, such calculations, say for piecewise smooth functions of a finite number 
of variables, can be done, also quite rigorously, using interval arithmetics; see 
e.g. [27]; again, the only limitation here is the computer power. It should be 
quite clear that all such calculations done with an aid of a computer are no 
less reliable or rigorous than similar, or even less involved, calculations done by 
hand. 



For alH = 1, . . . ,n, let 

Xi := aiEi. 

Next, let Xi, . . . , Xn be any r.v.'s such that 

E.g(Xi,...,X„) = ^-^5- (2.3) 
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for all Borel-measurable functions g : M" — > M. Equivalently, one may require 
condition (2.3) only for Borel-measurable indicator functions g; clearly, such 
r.v.'s Xi do exist. It is also clear that the r.v.'s Xi are independent. Moreover, 
for each i the distribution of Xi is (e"'5ai + e^"'(5_oi)/(e"' + e""'). 

Formula (2.3) presents the mentioned Esscher tilt transform, with the tilting 
parameter (TP) the same as the x in (1.7); that is, we choose the TP to be the 
minimizer of e~*^ E e*^ = e~*^^* " in f ^ — rather than the minimizer of 
g-tan £ ^ts„ ^ vifhich latter is usually taken as the TP in limit theorems for large 
deviations and can thus be expressed only via an implicit function. Our choice 
of the TP appears to simplify the proof greatly. 

In terms of the tilted r.v.'s Xi, . . . , X„, introduce now 



m,:=^EX, = i^^,thu„ ,,:= /^VarX, = ij5]-^, (2.4) 
L,:^^yE\X,~EX^\\ (2.5) 

where ch := ch, sh := sh, th := th, and arcch :~ arccosh (assuming that 
arcchz ^ for all z £ [l,oo); thus, for each z g [l,oo), arcchz is the unique 
solution y ^ to the equation chy = z). Let F„ and <I> denote, respectively, the 
tail function of Xi + • • • + Xn and the standard normal tail function, so that 

Fniz) = P(li + ---+Xn> z) and ¥(z) = P{Z ^ z) 

for all real z. Also, let cbe denote the least possible constant in the Berry-Esseen 

inequality 

(z — Tn \ 
-) ^CBE^x; (2.6) 
.o^^» Sx ' 

by Shcvtsova [80], 

^ 56 . 
Cbe ^ Too. 

a slightly worse bound, cbe ^ 0.5606, is due to Tyurin [84]. 
Lemma 2.1. For all a; ^ 

P(^„ ^x)^ N{x) + 2cbe5(x), (2.7) 

where 

T" S / T" rfl \ 1 

2jlnchu^ H -^ - xnix +ln$f ^^ -\- xsx) k (2.8) 

i 

5(a;) -L^cxpj -x^ + ^lnchujj. (2.9) 

i 

Next, introduce the ratio 

x$(x) 
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which is the inverse Mills ratio at x divided by x. By [66, Proposition 1.2], r is 
strictly and continuously decreasing from oo to 1 on the interval (0, oo), so that 
there is a unique root X3/2 G (0, 00) of the equation 



at that, 
and 

Introduce also 
and 



^2^3/2) = 3/2; 
X3/2 = 1.03... 
1 < r{x) ^ I for X ^ 2:3/2. (2.11) 



^ = 0-408 



hix) := ^^ (2.12) 



Ctfix) 
9 + x 
(cf. (1.7)). Now one can state an upper bound on the term N{x) in (2.7) 

Lemma 2.2. If x ^ 0:3/2 then N{x) < $(a:). 

Lemma 2.3. If x ^ ^ and w„ ^ u», then 2cb-eB{x) ^ h{x). 

Lemma 2.4. For all a: > 0, h{x) < ^^■ 

Lemma 2.5. For all x > 0, P(ei ^ x) ^ ¥(a:) + h{x). 

Introduce 

X — a T. -\- n. 

U := Ux.a := , and V := V^n 



with a as in (2.1). 

Lemma 2.6. If x ^ Vs , then i$([/) + ^'^{V) ^ $(a:). 

Lemma 2.6 was proved in [12]; cf. also [73, Lemma 5]. 

Lemma 2.7. If x ^ ^ and m„ ^ u^, then \h{U) + \h{V) ^ h{x). 

Lemma 2.8. For all x e (0, ^3] 

P(S'„ ^x) s; $(a:) + /i(x). (2.13) 

Proof of Theorem 1.1. By definition (2.12) and Lemma 2.4, it is enough to prove 
inequality (2.13) for all a: > 0. This can be done by induction on n. Indeed, for 
n = 1 this is Lemma 2.5. Assume now that n ^ 2. In view of Lemma 2.8, it is 
enough to prove inequality (2.13) for all x > V3 . At that, in view of Lemmas 2.1, 
2.2, and 2.3, it is enough to consider the case m„ > u*. To do that, write 

P(5„ ^x)^\ P(^„_i ^U) + \ P(5'„_i ^ V), 

where 5'„_i := biEi + • • • + 6„_ie„_i, with bi := Oi/y/l — a?- . It remains to use 
the induction hypothesis together with Lemmas 2.6 and 2.7. D 
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3. Proofs of Lemmas 2.1, 2.2, 2.4, 2.5, and 2.8 

Proof of Lemma 2.1. Reading equation (2.3) with .g(Xi, . . . , X„) = e"^"^" l{Sn ^ 
x} right-to-left, recahing (2.6), and observing that Ee^"^" = Hi'^hui, one has 



P{Sn > X) 

where 



e-"^dF„(y)= / xe--''(i^„(x)-i^„(y))dy^7Vi(:E)+Bi(x), 

[a:,oo) 



Ni{x) :-- 



— / a; — 771, 



f 



— (11 — rrix 



dy 



and 



Bi(.t) :=2cBEi. 



y - mx\ dy _ N{x) 
So: / S^; Ee^^" 



\ S-r. / 



'<ly = 2cBE.L^e ^ 



Ee^S" 



Thus, (2.7) foUows. 



D 



Now and later in the paper, we need the following special I'Hospital-type rule 
for monotonicity. 

Proposition 3.1. ([70, Propositions 4.1 and 4.3]) Let — cxj ^ a < 6 ^ oo. Let 
f and g be differ entiable functions defined on the interval {a,b). It is assumed 
that g and g' do not take on the zero value and do not change their respective 
signs on (a, b). 

(I) If f{a+) = g{a+) = or f{b-) = g{b-) = 0, and if the ratio f'/g' 
is strictly increasing/decreasing on {a,b), then (respectively) (f/g)' is 
strictly positive/negative and hence the ratio f / g is strictly increasing /de- 
creasing on (a, b). 
(II) If fia+) = g{a+) = and if the ratio f'/g' switches its monotonicity 
pattern at most once on (a, b) — only from increase to decrease, then 
the ratio f/g does so. Similar statements, under the condition f{b—) = 
g{b—) — and/or for a switch from decrease to increase, are true as well. 

Proof of Lemma 2.2. Let us begin this proof by using the well-known fact that 
the tail function $ is log-concave. This fact is contained e.g. in [26, 62]. Al- 
ternatively, it can be easily obtained using part (I) of Proposition 3.1, since 
(ln<f>)' = — £. So, one can write 

ln$(j/) ^ ln$(a;) + (ln¥)'(a;)(y ~ x) = ln¥(a;) - xr{x){y - a;), 

^^^ -t- xs^ (cf. (2.8)) and r{x) defined by (2.10). Therefore and in 



with y = ^^ 
view of (2.4), 



4ln = 



N{x) 



$(a;) 



^8{r,v) := / e{u)+r- 1- 



f{u) 



v{(lu) 
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:= 


IncliM 

U2 


1 






2ch2 


u 
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tllU , J./ N -, thu 

and f(u):=l -— , ^^ 



u u ch u 

for u ^ 0, e(0) := and /(O) := 1, and r := 7-(a:;). Note that the probability 
measure i/ on the interval [0, xa] defined by (2.2) satisfies the restriction 

bdv = s^, where b{u) := — g — ■ (3-1) 

ch u 

Recalling now (2.11), we see that, to prove Lemma 2.2, we only need to show 
that £{r,v) ^ for all such probability measures v and all r <E [1, |]; in fact, 
since £{r,v) is affine in r, it suffices to consider only r € {1, |}. 

Using [59; Proposition 1] and the Mathematica command Reduce, one can 
check that each of the two systems (1, —h, / — e) and (1, —6, /) is an M+-system 
on [0,oo); for the system (1,— fo, / — e), it takes about 20 sec on a standard 
laptop, and about 1 sec for the system (1,-6,/). Since s^ E (0,1] and r ^ 1, 
the integrand in the integral expression of £{r,i') can be rewritten as g := 
r — I (/ — 9e) with 6 := ^ G (0, 1], and so, {l,—b,—g) is an M+-system on 
[0, oo), for any r ^ 1 and any value of s^. Hence, by [59, Proposition 2], the 
minimum of J{—g) dv, and thus the maximum oi £{r, v), over all the probability 
measures v on [0,a;a] satisfying the restriction Jbdu = s^ is attained when 
the support of i/ is a singleton subset (say {u}) of [0,a;a]. For this u, one has 
Sx = 1/ chu, and it now suffices to show that g{u) ~ e{u) + r- (l — /(u) chu) ^0 
for r e {1, |} and u £ [0,oo); using again the Mathematica command Reduce, 
it takes about 2 sec to check this in each of the two cases, r = 1 and ?" = §• □ 

Proof of Lemma 2.4- Using part (I) of Proposition 3.1, one can see that the 
ratio — , , is increasing in a; > 0, from to C. Now the result follows. D 

<I>(a:) ° 

Proof of Lemma 2.5. Observe that the definition (1.8) of C is equivalent to 
the condition $(1) + h{l) = i (cf. Remark 1.2). Hence and because $ + /i is 
decreasing on (0, oo), one has P(ei ^ x) = ^ = $(1) + h{l) ^ $(a;) + h{x) for 
ah X e (0, 1]. For x > 1, one obviously has P(£i ^ a;) = < ¥(a;) + h{x). D 

Proof of Lemma 2.8. By the symmetry, Chebyshev's inequality, and the main 
result of [73], 

P(5„ ^ a;) < i I{0 < x s; 1} + 2^ I{1< .T < i§} + 3.22¥(x) I{i| < a; s^ ^3 } 

for all X G (0,-\/3]- In particular, for all x e (0,1] one has P(S'„ ^ a;) ^ ^ = 
P(ei ^ a;) ^ $(a;) + h{x), by Lemma 2.5. 

Next, let us prove (2.13) for x G (1, y|]. As in the proof of [73, Lemma 3], 
one can see that the minimum of x^^{x) over x G [1, i§] is attained at one 
of the end points of the interval [1, yS]; in fact, the minimum is at a; = 1. It 
is also easy to see that the minimum of x^h{x) over x G [1, -jl] is attained 
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at X = 1 as well. Thus, P(5„ ^ x) s^ ^ ^ 7=-^^ t(^{x) + h(x)) ^ 

^^ 2x^[<!>{x) + h{x)j ^ ' ' ^ '' 



($(x) + h{x)) = $(a;) + h{x) for a; e (1, 



131 

2(_$(l) + h(l)j V ^ ' - '/ V . ^ . > - lOJ 

The case a; S (i§:V3] is similar to the just considered case x € (lii§]- 
Here, using part (II) of Proposition 3.1 (cf. [73]), one can see that /i/$ switches, 
just once, from increase to decrease on (0,oo); in particular, /i/$ increases on 
(y§i VS]- So, to complete the proof of Lemma 2.8, it is enough to check that 
3.22¥(i§) < ¥(i§) + /i(i§), which is true. D 

4. Proof of Lemma 2.3 

We begin with a technical sublemma, used in the proof of Sublemma 4.2: 
Sublemma 4.1. For each a e [0, 1], the function 

(0, l]3v^ ha{v) := arcch(^)(2 - v){v - ^) (4.1) 

is concave. 

We shall need the following tight upper bound on the Lyapunov ratio L^, 
defined by (2.5): 

Sublemma 4.2. One has 

i^- ^ -5- Vw?(l + th^Uj)ch'u,. (4.2) 

i 

By Sublemma 4.2 and the definition (2.9) of B{x), 

B{x) s; ^e-^"+\ (4.3) 



J := J{x, v) := x"^ i diy + In k dv, 



where 



fc(u) := u(l + th^ u) ch u, £{u) := isf^ for m 7^ 0, 

i{0) :— 2 J a-nd i' is the probability measure on the interval [0,w,] defined by 
(2.2), so that 1/ satisfies the restriction (3.1). Noting that (Inch)" = th' = scch 
and applying (twice) the special IHospital-type rule for monotonicity given by 
part (I) of Proposition 3.1, one sees that 

£'<0 on (0,00). (4.4) 

To obtain the upper bound h{x) on 2cBE^(a;) as stated in Lemma 2.3, we shall 
maximize J{x, v) over all such probability measures v. To do so, we shall max- 
imize j kdv given values of the integrals /ld!^(= 1), jhdv{^ s^, as in (3.1)), 
and j (.dv. 
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Sublemma 4.3. The sequence (<?o,ffi,ff2,33) '■— {^,^b,—£,k) is an Mj^- system 
on [OjU*]. 

So, by [59, Proposition 2] (with n — 2 and m = 1 there), it suffices to consider 
z/'s of the form v — (\ — t)5u + tSu, for some t e [0, 1] and u G [0, m*]. For such 

J{x,v) = J{t,u) ■= Jx{t,u) := x^-{il-t)£{u)+U{u^))+ln{{l-t)k{u)+tk{u^)). 

Thus, we need to maximize J{t,u) over all {t,u) G [0,1] x [0,u*]; clearly, this 
maximum is attained. For all {t, u) £ (0, 1) x [0, u,), 

/dJ\ (1 - t)k{u) + tk{u^) _ (fcK) - k{u)) + T {£{u^) - i{u)) 
\ dt J u^ — u Ui, — u 

= k'{w)+T£'{w), (4.5) 

\auJ 1 — t 

where r := x^- [{l — t)k{u)+tk{u^)) and w is some number such that u < w < u^ 
(whose existence follows by the mean- value theeorem). So, if the maximum of 
J over the set [0,1] x [0,w*] is attained at some point (t, m) G (0,1) x (0,w*), 
then at this point one has ^ = = ^, whence, by (4.5), (4.6), and (4.4), 

„, ,1 = — r = ',, while m* > w > u ^ 0, which contradicts 

Sublemma 4.4. The function p :^ -p- is strictly increasing on the interval 
[0, It*] (hy continuity, we let p{{)) := /o(0+) = — ooj. 

Also, no maximum of J is attained at any point (i,w) G (0,1) x {0}, because 
at any such point the right-hand side of (4.6) is A;'(0) -|- t£'{Q) = 1 + r • > 0, 
whereas the left-hand side of (4.6) must be ^ 0. Thus, the maximum can be 
attained at some point (i, u) G [0, 1] x [0, u*] only if either t G {0, 1} or w = m*. 
Therefore the maximizing measure v must be concentrated at one point, say u, 
of the interval [0,m,]. Together with (4.3), this shows that 



B{x) < sup ie 

ue[o,u,] 



-x^ + Jo 



where 

Jo '■= Jo{x, u) := Jx{0,u) = X ■ £{u) + h\k{u). 

So, Lemma 2.3 reduces now to the following statement: 

a;2 (?) 

A(x,m) := Jo{x,u) - — -\\-ix + \\-i{% + x^)-K sC (4.7) 

for all {x,u) G [y|,oo) x [O,?/*], where 

i^:=ln. ^ 



2\/27rcBE 
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Thus, one may want to maximize A in m G [0, u*]. Towards that end, observe 
that for all u > 

1 aA _ 2 



-t{u) du 
where 



7(ii) 



___fc^_ 1 

so, the partial derivative of A in w > equals 7(u)— a;^ in sign. On the other hand, 
the function -^ is positive and strictly decreasing and, in view of Sublemma 4.4, 
the function (— p) is so as well (on the interval [0, u*]). It follows that the function 
7 too is positive and strictly decreasing on (0,w,]; at that, 7(0+) = 00. 
Introduce now 

x, := v^tK) = 7.39 . . . . (4.8) 

By the mentioned properties of the function 7, for each x G (0,a;*] one has 
7(u) J^ x^ for all u e [0, u,] and hence A(a;, u) increases in u e [0,Uh.], so that 
K(x,u) ^ A(a;, u*) for all u £ [0,u*]. Since the derivative of A(a;,u*) in a; is a 
rather simple rational function, it is easy to see that A(x, w*) ^ for all x ^ yS. 
So, inequality (4.7) holds for all {x,u) g [Y§,a;*] x [0, it,]. 

It remains to prove (4.7) for each x S [a;*,(X)) (and aU u G [0,^*]). For each 
such X, there is a unique Ux € [0,m*] such that ^{u) — x^ and hence ^ are 
opposite to u — Uj. in sign, and so, A(x, u) ^ A(a;, u^.) for all u G [0, w,]. 

Since, by (4.4), the function i is strictly and continuously decreasing on 
[0, CJo), there is a unique inverse function £~^ : (0, i] i-^ [0, 00). Now introduce 

Jo{x,X)-^ Ja{x,r\X)) =x^X + \nk{X), where I -.^ k o T^ 

and A G [^(uh.),£(0)] = [^(■u,),i]. Next, observe that (Infc)' = -70^-1, which 
is decreasing on [£(u,), ^], because 7 and ^ (and hence £~^) are decreasing. It 
follows that the function In A: is concave on [^(w,), 5], and so, Jo(a;, A) is concave 
in A G [£(u,), i] - for each real x. At this point, we need 

Sublemma 4.5. i^or all u G (0, it*], 

7(«) > ^. 

By (4.8) and Sublemma 4.5, if u = -^ and x ^ x,, then u G (0,?i,] and 
7(^) > a;2 = 7(w:r), which in turn implies that ^ < u^, £(^) > (.{u^), 

and (lnfc)'(£(^)) < (Infc)'(^(it^)) = -7(u.O = -a;^ (since 7, ^, and (In A:)' 
are decreasing); so, for aU A G [^(u*),^], %:{x,i{^)) < ^{xJiu^)) = 0; 
therefore and by the concavity of Jo{x, A) in A, 

j,ix,x) ^ jo{x,ii^)) + ^{x,ii^)) {X- e{^)) ^ Mx,^) 

for all A G [^(it*), |], where 

Joix,u) := Jo(x,m) + (x^ - 7(u)) (^(u*) - ^(u))- 
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Thus, in view of (4.7), Lemma 2.3 reduces to the inequahty Joix, ^) — ^ — 
\nx + hi(9 + .T^) — A' ^ for all x ^ x^,, where we change the variable once 
again, from x to u, by the formula a; = -^^. So, Lemma 2.3 reduces to 

Sublemma 4.6. For all u G (0, u*] 

K{u):=U^,u)~^-\n^+\n{'d + ^)^K. 

It remains, in this section, to prove Sublcmmas 4.1-4.6. 

Proof of Sublemma ^.l. Since ha{v) is afhne in a, w.l.o.g. a G {0, 1}. Consider 

first the case a = 0. Observe that '^^''"'j*''^^ • A{t^ - 1)^/2 := (3 - 2t^){U^ - 1), 

which switches the sign from + to — at f = i/l as f increases from 1 to oo. 

Hence, the maximum of hQ{v) in u G (0,1] is attained at u = |, and this 
maximum is easily seen to be negative, which proves the case a = 0. 
The case a = 1 is considered similarly. Observe that 



diV 



4h'(it-^) 
8 + t3 + 6^5 , 



)](i- 1)3/2(1 + f)5/2(8 + i3 + 6i5) = 



is a certain polynomial in t (of degree 13), which switches the sign from + to — 
at a certain algebraic number ti as t increases from 1 to oo. Hence, the maximum 
of g a!) I gf.'i in t ^ 1 is attained at t = ti, and h'l{t^'^) can seen to be negative 
(using e.g. the Reduce command again), which proves the case a = 1 as well. 
Sublemma 4.1 is now completely proved. D 

Proof of Sublemma 4--2. Observe that L^ = {xsx)~^ X); ■"?(! ~ th Ui). So, in- 
equality (4.2) means exactly that 



E 



ul{l - ih^ Ui) ~ sl'^iclil + ih^ Ui) cYlu, = ^ulg[ui) < (4.9) 



for all UiS in the interval [0,Uh.] such that X^i ^f ^ ^^ ^'^"^ Tli i"' = a^^s^, 
where 

n{u) := u(l - th^ u) - s\u{\ + th^ u) chw = m( 2 ,— ) ( — = si! ) chu. 

^ ^ ^ ^ X V I V ch^ii/vch^u ^y 

Next, the object J^i'^fdi'^^i) i'^ (4-9) with the restrictions X^i ""? ~ ^^ ^^'^ 

2 

Xj ct^i\. = 2;^s^ can be rewritten as x"^ Eh{Y) given EF = s^, where /i(-) := 
/Iq(-) (as in (4.1)) with a = si^. and F is a r.v. with the distribution v := 
■^ J2i ^f<5«i J with Vi := ^^'2^. ; note that one always has s^: G (0, 1] and i/ is indeed 
a probability measure due to the restriction Xi "f = 2^2. So, by Sublemma 4.1 
and Jensen's inequality, a;~^ Xi "?5("i) = E/i(l^) ^ /i(Ey) = h{s'^) = 0, which 
proves the inequality in (4.9) and hence that in (4.2). D 
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Proof of Sublemma 4.3. By [59, Proposition 1], it is enough to show that the 
Wronskian Wj is strictly positive on the interval (0, u*) for each j = 0, . . . , 3. It is 
obvious that Wl^{u) = 1 > and Wq (u) = 2thM/ ch^ m > for ah m > 0. Next, 
using the command Reduce, one obtains (in about 3 sec) that j Wq{u) u ch u = 
2uchwsh^ u — u^sb^ u — (u + 3 chu shu ~ 2ush^ w) chu Inchu > for all m > 0. 
It remains to prove the positivity (for u £ (0,w,)) of 

Wi^iu) := 16u^ ch^ u Wi^{u) = Pa{u) + Pi{u) Inchw, where (4.10) 

Po(u) ;= uilQlu - 5m^ - 4(97^^ + 63) sh(2w) + 2{Alu^ - 72) sh(4u) 

+ 12(5^2 + 3) sh(6w) - (13^2 - 36) sh(8M) + 2(9^^ + 17)w ch(2ii) 

- 24(it2 + 4)^ ch(4'u) + 2(714^ + 31)u ch(6M) - (314^ - 25)u ch(8u)] , 

Pi{u) := 2 ch(it) ( - 250m^ sh(M) - 98'u^ sh(3M) - 34u^ sh(5M) + 6^^ sh(7'u) 

- 161ii2 ch(5M) + 23^2 ch(7w) + (321^^ + 432) ch(ii) + (96 - 471^^) ch(3w) 

+ 15ush(w) - 147wsh(3w) - 195ush(5w) - 33ush(7M) - 96ch(5u) - 48ch(7M) 

Next, one can bracket the function In ch between two of its Pade approximants 
near (for a definition, see e.g. [2]): 

"^ V =: '"2,3 (it) < In chit < 7-4,2 (u) := m^+iu^) 



for all u g (0,00); these inequalities can be verified by hand or (in about 4 sec) 
using the command Reduce. So, by (4.10), it is enough to show that 

W2,z{u) := ^(Po(u) + Pi{u)r2,:i{u)) and 

M^4,2(w) := ^^i5±^(Fo(w) + Pi{u)r^Au)) 

are positive for all u € (0,u*). Note that VF2,3(u) and W4,2(u) are each an 
algebraic-hyperbolic polynomial, of the form 

4 
^ (p,(u2)uch(2jM) + q,{u^) sh(2iM)), (4.11) 

where Pj{u^) and qj{u^) are polynomials in u^ (of degree 2, except that the 
Qj's for W4,2 are of degree 3). Thus, the positivity of W^2,3 and W4,2 can be 
checked using the command Reduce. It takes about an hour this way to check 
that W4,2 > on (0, 00) and about 40 min to check that W^2,3 > on (0,-^**), 
where u** = 4.98 . . . > m,, so that indeed H^2,3 > and ^4,2 > on (0, w*). 

To avoid these long execution times, one can, alternatively, do as follows. Let, 
for a moment, W stand for cither W2,3 or W4,2- As mentioned above, W can be 
represented in the form (4.11). Now expand ch(2JM) and sh(2JM) into Maclaurin 
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series, and write 

oo 

W{u) = y^ Wm{u), where 

rn—O 



with (2ju)2™ := 1 if j = m = 0. Note that for m ^ 1 



3 



(2m)! 

k=0 



W„,{u) := Wr,^{u) ^^'^^Z-l = E^™^*""'^' 



where each coefBcient Cm,k is of the form X)7=i('^fe + bk'm)j'^"^ for some real 
numbers Uk and bk, depending only on k e {0, 1, 2, 3}. 

For W = W2^3, using the command Reduce (or manually), one can quickly 
verify that for all to ^ 4 inequalities c,„^i < 0, Cm.2 > 0, and c„i,3 < hold, which 
implies that for all u £ (0,m*) one has Wm{u) > Cmfi + Cm^iul + Cm.su^ > 0, 
the latter inequality quickly checked (say) by another Reduce. Thus, for all 
u € (0,M*) one has Wmiu) > for all tti ^ 4 and hence X)m=6^™('") ^ *-*• 
On the other hand, X]m=o^™(^*) ^^ ^ polynomial in u, which can be quickly 
checked by yet another Reduce to be positive for all u G (0,u*). This proves 
that W2,3 > on (0,u*). 

The case W = W4_2 is considered similarly. Here one can check that Cm.i < 
and Cm, 3 > for to, ^ 5 (whereas Cm,2 > changes in sign from + to — near m = 
24.1). So, Wmiu) > min(cm,Q + Cm,iul+Cm,2ut,Cm,o+Cr,i,iul+Cr,i,20'^) > for aU 
m ^ 5 and u e (0, w*), by two more applications of Reduce (say). On the other 
hand, in this case the polynomial X]m=o^™(^) (rather than X]m=o ^™('")) ^^ 
positive for all u G {Q,u^,). This proves that W4_2 > on (0,u*). D 

Proof of Sublemma 4-4- The main part of this proof, concerning the negativity 
of the function DDpi defined below, can be done similarly to the alternative 
proof of the positivity of 1^2,3 in W4.2 in Sublemma 4.3. Introduce 

fci 
ki{u) := k'{u)u^, £i(m) := i'{u)u^, so that P = -;-; 



ti 



k' 
let also pi := -7-. 



Further, for u > introduce 



32 ch^ u 

{Dpi){u):^ p\{u) (w-ish2u)^ and 



iDDp,)iu)-'^''P^^^-^^' "' 



2ch 



u 



= E {Pj{u^)uch{2ju) + qj{u^) sh(2jw)) == ^ ^™(")' 

j=0 m=0 
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where 



Pj{u^) and qj{u^) are polynomials in u^ (of degree 2), and each coefficient Cm,k 
is of the form "^j^iidk + 6fem)j^™ for some real numbers afc and hk, depending 
only on fc G {0,1,2}. Using the command Reduce, one can verify that for all 
m ^ 9 and k S {0, 1,2} inequalities Cm.k < hold, which implies that for all 
M > one has rm{u) < and hence X]m=9'"m('^) > O? the verification of the 
inequality Cm,2 < for m ^ 9 takes about 18 sec, while that of each of the 
inequalities c,„_o < and c,n,i < for m ^ 9, just a fraction of a second. 
On the other hand, X]m=o''™('") i^ ^ polynomial in u, which can be quickly 
checked by yet another Reduce to be negative for all u G (0, u*). This proves 
that DDpi < on (0, u*). It follows that ^ ^3 is decreasing on (0, u*), from 
lim^^o ^^iiT^ = 64 > to ^^P^J^'^-') < 0. So, the monotonicity pattern of 
pi changes exactly once on (0, u*), from increase to decrease. Note also that 
/ci(0) = ^i(O) — 0. So, by part (II) of Proposition 3.1, the monotonicity pattern 
of p can change at most once on (0, u*), and only from increase to decrease. 
However, p'{u^) = 0.017... > 0. Thus, p is strictly increasing on (0, u^) and 
hence, by the continuity, on [0,w*]. D 

Proof of Sublemma 4-5. Observe that 

u^ sech(2w) [chu + ch(3u) + u(3shii + sh(3u))] 
4 ch u In ch M — 2u sh u 

The denominator of this ratio is positive, since, as previously noted, 7 > 0. Also, 
because the function ch is convex, one has chu + ch(3u) + u(3 shu + sh(3u)) ^ 
chu + ch(3u) ^ 2 ch(2u). It follows that 



7(w) ^ 7(w) := 



^2 



2 ch M In ch u — u sh M 

for all G (0,u*]. It remains to use the Reduce command to see that 7(14) > -§3- 
for all e (0,u*]. D 

Proof of Sublemma 4-6- First, observe that 



A = Ti + T2 + T3T4, 



where 



2 In ch u u r~ 

Ti{u) :=3 -, , T2{u) :=ln(./|(2 + 3M2)ch(2M)sechM), 

yA \ V - ' 
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where 
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rr, fi 

J4 = — , 
54 



F4i{u) := — 241nchu + 6wsechu sech(2u)(sh(3u) — shu) 

+ u"'sech(2M)(l + sechw ch(3u)), 
^42 (w) :—u^scchu sech(2u)(3sh?i + sh(3u)), 

2 In ch u — u th u 
9i(u) ■■= 4 . 

Further, the derivatives F^^ (0) are for all j = 0, . . . , 5, whereas 

F^l'' (u) = 24 sech"^ u [80 ch u - 50 ch(3u) + 2 ch(5u) 

- u (302shii - 57sh(3w) + sh(5u))] . 

Using now the Mathematica Maximize and Minimize commands, one finds that 
-203 < F^f ^ 768 and hence |F]f | ^^ 768 on the interval [0,w,], which yields 
|^4i('")| =^ ^r^^ '^'^^ ^^^ ^ ^ [0,u*]. Using Maximize and Minimize again (on 
the ratio '^^e , subject to the restriction < u ^ u*), one finds that | ^^e' \ ^ 
lim^j^o *^6 = 6 for all u e (0,w*]. So, for all u e [0, u*] one has |F4(w)| ^ 
|i^4i(M)| + \F42{u)\ ==; (^ + 6)u^ whence 

l/4Ki(Tr+6)<4 

on the interval (0,w*]. On the other hand, once again using Maximize and 
Minimize, one sees that \^\ < 0.08 and hence |T3r4| = \U\\^\ ^ 4 x 0.08 = 
0.32. Applying the command Maximize twice more yields Ti < —0.47 and 
T2-K < 0, so that indeed A-/v = Ti + {T3T4) + {T2-K) < -0.47+0.32 + < 
on (0,it*]. Sublemma 4.6 is now completely proved. D 



5. Proof of Lemma 2.7 

This proof could be simplified using the mentioned result (1.5); however, we 
decided to present an independent proof here, which is not much more compli- 
cated. Let 

A ■- A{x, u) ~ ^ [lh{U{x, u/x)) + ^h{y{x, u/x)) - h{x)\ . 

We have to show that A ^ for all pairs {x, u) in the set 

P := {{x, u) e [y§, 00) X [u^,(X)) : u < x}, 
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the condition u < x corresponding to the condition a = a„ < 1. Introduce 

A •= A (2: u) ■= — ex r {u-x^Y 1 {x'-u'')p2{x,uf 

du \2{x'^ — V?) ) (u—\)x'^ {x"^ — u)pi{x,u)' 

A / X 5Ai r 2ua;2 -, 
A2 := A2(x, u) := — — exp <^ -^ ^ r 

(u — 1)^(2; — uY{u + x)^ (x^ — v) Pi{x, u)^p3{x, u)^ 



P2{X,U) 



where 



Pi{x,u) ■.^x'^{ll + x^)- {\Qu^ + 2ux^), 
P2{x,u)-~x^{Q + x^)-{%u^ + 2ux^), ) (5.1) 

P3.{x, u) := ^2(9 + x^) - (8^2 - 2ux'^) 
Consider also the set 

P := {{x,u) e [i§,oo) X [^,00): M < a;}, 

which is shghtly larger than P. Using e.g. the Mathematica command Reduce, 
one can see that on the set P the polynomials pi, p2, and p^ are positive, and 
so, Ai(x,w) and A2(a;,u) are equal in sign to {u — l)^{x,u) and ^^(a;,u), 
respectively, for all points [x^u) g P with u ^ 1; note here that u < x < x"^ for 
aU {x,u) e P. 

Note also that A2(x,-u) is a polynomial in {x,u) (of degree 24 in a;, and 
14 in u). Using again Reduce, one finds that A2 > on P. It follows that 
Ai(a;, u) increases in w G [^, 1) and in u £ (1, a;] for each x £ [j^, 00). Moreover, 
Ai(a;,a;— ) = — ^ < 0, whence Ai(a;,u) < for all {x,u) £ P such that m > 1. 

On the other hand, one has 

Sublemma 5.1. Ai, := Ai*(2:) := Ai(a;, ^^j) > for all x £ [^§,00). 

We shall prove this a bit later in this section. Since Ai{x,u) increases in u, 
it follows that Ai{x,u) > for all {x,u) £ P such that u < I. Recalling that 



Ai(x,u) equals (u — l)^(a:;,u) in sign, we conclude that A(a;, u) decreases in 
u £ {jq,x] for each x £ [y|,cxd). Therefore and because u* > j^, it remains to 



prove 

Sublemma 5.2. A*(x) ;= A(a;, m*) < for all x £ [^, 00) 

— as well as Sublemma 5.1. 

Proof of Sublemma 5.1. The derivative A']^^(x) of Ai*(a;) is of the form 
Ri{x)e^'^^^' , where Ri{x) is a certain rational expression in x. It follows (using 
again Mathematica, say), that A[^{x) ^ iff x ^ xi*, where xi, = 3.62 ... is 
the only root in the interval [y|,oo) of a certain polynomial (of degree 20). So, 
the minimum of Ai* on [j^, 00) is attained at the point xi*. Using the Mathe- 
matica command Reduce again, one finds that this minimum value is positive, 
which proves the sublemma. D 
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Proof of Sublemma 5.2. Introduce (cf. Ai*(x') in Sublcmma 5.l) 

A*i(a;) := 



2tt dx \ h{x 

3814697265625 r 51 {SQlx'^ + 51) ^ {x"^ ~ u^) p^^x^ 

^ 51 ^^^ 1 31250a;2 - 5202 i xjhJx) ' ^'' 

dA,i(a:) 

A^2[x) ■■= 

da; 

95367431640625 r 12750a;2 ^{x^~ulfp2*{xfpA*{xf 

^ 1224 ^^^ I ~ 15625x2-2601/ xp^^{x) ' 

where (recall (5.1)) 

P2*{x) :=p2(a;, u*), P3*(a;) :== _P3(a;, "*), 
PA*{x) := -184559856669+ 1289843642871a;2 + 244896587625x* 

+ 85828328125x^. 

Note that on the interval [y^,oo) the polynoinials p2*, P3*, Pa* are positive, 
and so, A*i(a;) and A*2(a;) are equal in sign to -^ ( "FTT^ ) ^'^"^ — 'dx ' '^®~ 
spectively, for all x € [y|,(X)). Moreover, A*2 is a polynomial (of degree 20), 
which is negative on the interval [y^,oo), so that A»i decreases on this in- 
terval. Next, A*i(f|) = 0.0042... > while A,i(4) < 0. So, the maximum 
of A,//i on the interval [y|,oo) is attained at some point between H and 4; 
it also follows that the maximum of A«i on the interval [f§,4] is less than 

0.0043. On the other hand, ^ -£ (^^) = A*i(a;)Af (cc), where, in view 
of (5.2), M{x) is the product of two positive expressions, one of which is 

exp I — 31250 ^-5202 } ^^^ ^^^ othcr is a certain rational expression. Using the 
Mathematica command Maximize allows one to find the exact maximum of 
M on [f§,4], which is attained at ^ and is less than 0.00552; so, the maxi- 
mum of -^ -£ (^^) in a; e [f§,4] is less than 0.0043 x 0.00552 < ■^^. 
Thus, the maximum of -^A»//i on the interval [y^,oo) is no greater than 
;^ A,(f§)//i(f§) + ^oQ (4 - f§) = -0.000018... < 0. This completes the 
proof of Sublcmma 5.2. D 
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